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SOURCE CODING ENHANCEMENT USING SPECTRAL-BAND REPLICATION 
TECHNICAL FIELD 

In source coding systems, digital data is compressed before transmission or storage to reduce Uie required bitrate or 
storing capacity. The present invention relates to a new method and apparatus for the improvement of source coding 
systems by means of Spectral Band Replication (SBR). Substantial bitrate reduction is achieved while maintaining 
the same perceptual quality or conversely, an improvement in perceptual quality is achieved at a given bitrate. This 
is accomplished by means of spectral bandwidth reduction at the encoder side and subsequent spectral band 
replication at the decoder, whereby the invention exploits new concepts of signal redundancy in the spectral domain. 

BACKGROUND OF THE INVENTION 

Audio source coding techniques can be divided into two classes: nahiral audio coding and speech coding Natural 
audio coding is commonly used for music or arbitiary signals at medium bitrates, and generally offers wide audio 
bandwidth. Speech coders are basically limited to speech reproduction bm can on tiie other hand be used at very low 
bhrates. albeit with low audio bandwidth. Wideband speech offers a major subjective quality improvemem over 
narrow band speech. Increasing flie bandwidtii not only improves intelUgibiUty and natuiataess of speech, but also 
facUitates speaker recognition. Wideband speech coding is thus an important issue in next generation telephone 
systems. Further, due to the timendous growth of the multimedia field, transmission of music and oUier non-speech 
signals at high quality over telephone systems is a desirable feature. 

A high-fideUly linear PCM signal is veiy inefficiem in terms of bitiate versus tiie perceptiial entropy. The CD 
standard dictates 44. 1 kHz sampling frequency. 16 bits per sample resolution and stereo. This equals a biUate of 
141 1 kbit/s. To drastically reduce Uie bitiate. source coding can be performed using split-band perceptual audio 
codecs. These nattiral audio codecs exploit perceptiial inelevancy and statistical redundancy in Uie signal. Using the 
best codec technology, approximately 90% data reduction can be achieved for a standard CD-format signal with 
practically no perceptible degradation. Very high sound quality in stereo is thus possible at around 96 kbit/s, i.e a 
compression factor of approximately 15:1. Some perceptiial codecs offer even higher compression ratios. To 
achieve this, it is common to reduce the sample-rate and tiius the audio bandwidth. It is also common to decrease Uie 
nmnber of quantization levels, allowing occasionally audible quantization distortion, and to employ degradation of 
Uie stereo field, Uuough intensity coding. Excessive use of such meUiods results in araioying perceptual degradation 
Current codec technology is near satiiration and furUier progress in coding gain is not expected. In order to improve 
the coding performance further, a new approach is necessary. 

The hmnan voice and most musical instiuments generate quasistationary signals Uiat emerge from oscillating 
systems. According to Fourier Uieory, any periodic signal may be expressed as a sum of sinusoids witfi Uie 
frequencies/ 2/ 3/ 4/ 5/etc. where/is Uie fimdamental frequency. The frequencies form a harmonic series A 
bandwidtii Dmitation of such a signal is equivalent to a tiuncation of tiie harmonic series. Such a truncation alters Uie 
perceived timbre, tone colour, of a musical instiument or voice, and yields an audio signal Uut will sound "muffled" 
or "duU". and intelligibiUty may be reduced. TTie high frequencies are tiius imponant for Uie subjective impression 
of sound quality. 
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Prior art methods are mainly intended for improvement of speech codec perfoimance and in particular intended for 
High Frequency Regeneration (HFR), an issue in speech coding. Such methods employ broadband linear frequency 
shifts, non-linearities or aliasing [U.S. Pat. 5^2.7,054] generating inten^^^^^ 

frequency components which cause severe dissonance when applied to music signals. Such dissonance is referred to 
in the speech coding literature as "harsh" and "rough" sounding. Other synthetic speech HFR methods generate 
sinusoidal harmonics that are based on fundamental pitch estimation and are thus limited to tonal stationary sounds 
[U.S. Pat. 4,771.465]. Such prior art methods, although useful for low-quality speech apphcaUons. do not work for 
high quality speech or music signals. A few methods attempt to improve the performance of high quaUty audio 
source codecs. One uses synthetic noise signals generated at the decoder to substihite noise-like signals in speech or 
music previously discarded by the encoder ["Improving Audio Codecs by Noise Substitution" D. Schultz, JAES, 
Vol. 44, No. 7/8, 1996]. This is peifoiroed within an othenvise normally transmitted highband at an inteimittent 
basis when noise signals are present. AnoUier method recreates some missmg highband harmonics that were lost in 
the coding process ["Audio Spectral Coder" A.J.S. Feireira, AES Preprint 4201, 100*' Convention, May 1 1-14 
1996, Copenhagen] and is again dependent on tonal signals and pitch deteaion. Both methods operate at a low duty- 
cycle basis offering comparatively limited coding or performance gain. 

SUMMARY OF THE INVENTION 

TTie present invention provides a new method and an apparatus for substantial improvements of digital source 
coding systems and more specifically for the improvements of audio codecs. The objective includes bitrate 
reduction or improved perceptiial quaUty or a combination thereof The invention is based on new methods 
exploiting harmonic redundancy, offering the possibility to discard passbands of a s.gnal prior to bansmission or 
storage. No perceptual degradation is perceived if the decoder performs high quality spectral replication according 
to the mvention. The discarded bits represent the coding gain at a fixed perceptual quality. Alternatively, more bits 
can be allocated for encoding of tiie lowband information at a fixed bitiate, thereby achievmg a higher perceptual 
quality. 



n.e presem invention postulates timt a truncated harmonic series can be extended based on U,e direct relation 
between lowband and highband spectral components. This extended series resembles the original in a perceptual 
sense rfcertam rules are followed: First, the extiapolated spectral components must be harmonically related to tiie 
bleated harmonic series, in order to avoid dissonance-related artifacts. The present invention uses tinnsposition as 
a means for the spectral rephcation process, which ensures that Ms criterion is met. It is however not necessary that 
the lowband spectral components form a harmonic series for successful operation, since new rephcated components 
harmomcally related to tiiose of the lowband, will not alter the noise-like or transiem natme of ttie signal A 
transposition is defined as a transfer of partials from one position to another on tiie musical scale while maintaining 
the fiequency ratios of the partials. Second, the spectral envelope, i.e. tiie coarse spectral distribution of the 
rephcated highband, must reasonably well resemble tltat of the original signal. TTie present invention offers two 
modes of operation, SBR-1 and SBR-2, that differ in the way the spectral envelope is adjusted. 

SBR-1, intended for the improvement of intennediate quality codec applications, is a single-ended process which 
rehes exclusively on the information contained in a received lowband or lo«p.ss signal at the decoder. TT»e spectial 
envelope of this signal is determined and extiapolated, for instance using polynomials together with a set of rules or 
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20 



a codebook. This information is used to continuously adjust and equalise the replicated highband. The present SBR- 
1 method offers the advantage of post-processing, i.e. no modifications are needed at the encoder side. A 
broadcaster will gain in channel utilisation or will be able to offer improved perceptual quality or a combinaUon of 
both. Existing bitstream syntax and standards can be used without modification. . 

SBR-2, intended for the improvement of high quality codec applications, is a double-ended process where, in 
addiUon to the transmitted lowband signal according to SBR-1, the spectral envelope of the highband is encoded and 
transmitted. Since the variations of the spectral envelope has a much lower rate than the highband signal 
components, only a limited amount of information needs to be transmitted in order to successfully represent the 
spectral envelope. SBR-2 can be used to improve the performance of current codec technologies with no or minor 
modifications of existing syntax or protocols, and as a valuable tool for future codec development. 



Both SBR-1 and SBR-2 can be used to replicate smaller passbands of \ht lowband when such bands are shut down 
by the encoder as stipulated by the psychoacoustic model under bit-starved conditions. This results in improvement 
15 of the perceptual quality by spectral replication within tlie lowband in addition to spectral replication outside the 
lowband. Further. SBR-1 and SBR-2 can also be used in codecs employing bitrate scalability, where the perceptual 
quality of the signal at the receiver varies depending on transmission channel conditions. This usually implies 
annoying variations of the audio bandwidth at the receiver. Under such condiUons, the SBR methods can be used 
successfully in order to maintain a constanUy high bandwidUi, again improving the perceptual quality. 



The present invention operates on a continuous basis, replicating any type of signal contents, i.e. tonal or non-tonal 
(noise-like and transient signals). In addition, the present spectral repUcaUon method creates a perceptually accurate 
repHca of the discarded bands from available frequency bands at the decoder. Hence, Uie SBR method offers a 
substantially higher level of coding gain or perceptual qualiti^ improvement compared to prior art methods. The 
25 invention can be combined with such prior art codec improvement methods; however, no performance gain is 
expected due to such combinations. 



The SBR-method comprises the following steps: 

- encoding of a signal derived from an original signal, where frequency bands of the signal are discarded and 
30 the discarding is performed prior to or during encoding, forming a first signal, 

- during or after decoding of the first signal, transposing frequency bands of the first signal, forming a second 
signal, 

- performing spectral envelope adjustment, and 

- combining the decoded signal and the second signal, forming an output signal. 



35 



40 



The passbands of the second signal may be set not to overlap or partly overiap the passbands of the first signal, and 
may be set in dependence of the temporal characteristics of the original signal and/or the first signal, or transmission 
channel condiUons. The spectral envelope adjustment is performed based on estimation of the original spectral 
envelope from said first signal or on transmitted envelope information of the original signal. 
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The present invention includes to basic types of tiansposers: mulUband tmnsposers and timc-variant pattern search 
prediction transposers, having different properties. A basic multiband transposition may be performed according to 
the present invention by the following: 

- filtering the signal to be transposed through a set of > 2 bandpass filters wiUi passbands comprising the 
frequencies Ifu^.^M respectively, forming bandpass signals, 

- shifting the bandpass signals in frequency to regions comprising the frequencies M [/i,...,/^J where 1 is 
the transposition factor, and 

- combining the shifted bandpass signals, forming the transposed signal. 

Alternatively, this basic multiband transposition may be performed according to the invention by the following: 

- bandpass filtering the signal to be transposed signal using an analysis filleitank or transform of such a nature 
that real- or complex-valued subband signals of lowpass type are generated, 

- an arbitrary number of channels k of said analysis filterbank or Uansform are comiected to chamiels Adk,M^ 
1, in a synthesis filterbank or transform, and 

- the transposed signal is formed using the synthesis filteitank or transform. 

An improved multiband transposition according to the invention incorporates phase adjustments, enhancing the 
performance of the basic multiband transposition. 

Tlie time-variant pattern search prediction transposition according to the present invention may be performed by the 
following: 

- performing transient detection on the first signal, 

- determining which segment of the first signal to be used when duplicating/discarding parts of tlie first signal 
depending on the outcome of the transient detection, 

- adjusting statevector and codebook properties depending on the outcome of the transient detection, and 

- searching for synchronisaUon points in chosen segment of the first signal, based on the synchronisaUon point 
found in the previous synchronisation point search 

The SBR meUiods and apparatuses according to the present invenUon offer the following features: 
1. The methods and apparatuses exploit new concepts of signal redundancy in the spectral domain. 
The methods and apparatuses are applicable on arbitrary signals. 
Each harmonic set is individually created and controlled. 

All replicated harmonics are generated in such a mamier as to form a continuation of the existing harmonic 
series. 

The spectral repUcation process is based on transposiUon and creates no or impercepUble artifacts. 

6. Tlie spectral repUcation can cover multiple smaller bands and/or a wide frequency range. 

7. In the SBR-l method, the processing is performed at the decoder side only. i.e. all standards and protocols 
can be used without modification, 

8. n,e SBR-2 method can be implemented in accordance with most standards and protocols with no or minor 

modifications. 

9. The SBR-2 method offers the codec designer a new powerfiil compression tool. 

10. The coding gain is significant. 



2. 
3, 
4. 

series, 

5. 
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The most attractive appUcaUon relates to the improvement of various types of low bitrate codecs, such as MPEG 1/2 
Uyer I/II/III [U.S. Pat. 5,040,217], MPEG 2/4 AAC, Dolby AC-2/3, NTT TwinVQ [U.S. Pat 5,684.920], 
AT&T/Lucent PAC etc. The invention is also usefiil in high-quality speech codecs such as wide-band CELP and 
SB-ADPCM 0.722 etc. to improve perceived quality. The above codecs are widely used in multimedia, in the 
telephone industry, on the Internet as well as in professional applications. T-DAB (Teirestiial Digital Audio 
Broadcasting) systems use low bitrate protocols that will gain in channel utUisation by using the present method, or 
improve quality in FM and AM DAB. Satellite S-DAB will gain considerably, due to the excessive system costs 
involved, by using the present method to inaease the number of programme channels in the DAB multiplex. 
Furthermore, for the first time, full bandwidth audio real-time streaming over the Internet is achievable using low 
bitrate telephone modems. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention wiU now be described by way of illustrative examples, not limiUng the scope or spirit of the 
invention, with reference to the accompanying drawings, in which: 

Fig. 1 illustrates SBR incorporated in a coding system according to die present invention; 

Fig. 2 iUustrates spectral repUcation of upper harmonics according to the present invention; 

Fig. 3 iUustrates spectral repUcation of inband harmonics according to the present invention; 

Fig. 4 is a block diagram for a time-domain implementation of a transposer according to the presem invention; 

Fig. 5 is a flow-chart representing a cycle of operation for the pattern-search prediction transposer according to the 

present invention; 

Fig. 6 is a flow-chart representing the search for synchronisation point according to the present invention; 
Fig. 7a - 7b iUusti^tes the codebook positioning during transients according to the present invention; 
Fig. 8 is a block diagram for an implementation of several time-domain tnmsposers in connection with a suitable 
filterbank, for SBR operation according to the present invention; 

Fig. Ja - 9c are block diagrams representing a device for STFT analysis and syntiiesis configured for generation 
of 2" order harmonics according to Uie present invention; 

Fig. 10a - 10b are block diagrams of one sub-band with a linear frequency shift in the STFT device according to 
the present invention; 

Fig. 1 1 shows one sub-band using a phase-multiplier according to the present invention; 

Fig. 12 iUustrates how 3^" order harmonics are generated according to the present invention; 

Fig. 13 iUustrates how 2"^ and 3"" order harmonics are generated simultaneously according to Uie present 

invention; 

Fig. 14 iUustrates generation of a non-overlapping combination of several harmonic orders according to the 
present invention; 

Fig. 15 iUustrates generation of an interleaved combination of several harmonic orders according to tlie presem 
invention; 

Fig, 16 illustrates generation of broadband linear frequency shifts; 

Fig. 17 iUustrates how sub-haimonics are generated according to the present invention; 

Fig. 18a - 18b are block diagrams of a perceptual codec; 

Fig. 19 shows a basic stnicture of a maximally decimated filteibank; 
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Fig. 20 iUustrates generation of 2"" order harmonics in a maximally decimated filterbank according to the present 
invention; 

Fig. 2 1 is a block diagram for the improved multiband transposition in a maximally decimated filterbank 
operating on subband signals according to the present invention; 

Fig. 22 is a flow-chart representing the improved multiband transposition in a maximaUy decimated filterbank 
operating on subband signals according to the present invention; 
Fig. 23 illustrates subband samples and scalefectors of a typical codec; 

Fig. 24 illustrates subband samples and envelope information for SBR-2 accoiding to Uie presem invention; 
Fig. 25 iUustrates hidden transmission of envelope information in SBR-2 according to tlie piesem invention- 
Fig. 26 iUustrates redundancy coding in SBR-2 according to the presem invention; 

Fig. 27 iUustrates an implementation of a codec using the SBR-1 method according to the present invention; and 
Fig. 28 iUustrates an implementation of a coded using the SBR-2 method according to the presem invention'. 
Fig. 29 is a block diagram of a "pseudo-stereo" generator according to the present invention. 

DESCRIPTION OF PREFERRED EMBODIMENTS 

niroughout the explanation of the embodiments herein, emphasis is given to natural audio source coding 
appUcations. However, it should be understood that the presem invention is applicable on a range of source coding 
appUcations other than that of encoding and decoding audio signals. 

Transposition basics 

TransposiUon as defined according to the present invention, is the ideal method for spectral repUcation, and has 
several major advantages over prior art, such as: no pitch detection is required, equally high performance for single- 
pitched and polyphonic programme material is obtained, and the transposiUon works equally well for tonal and non- 
tonal signals. Contrary to other methods, the transposition accoiding to the invenUon can be used in arbitrary audio 
source coding systems for arbitrary signal types. 

Ai, exact transpositionafactorMofadiscrete time signal *(«)in the form ofasum of cosines with time vary^^^ 
amplitudes, is defined by the relation 

i=Q ^ ' 



1=0 



(2) 



where N is the number of sinusoids, hereafter referred to as partial, e.(«). a. are the individual input frequencies 
ume envelopes and phase constants respectively, A are the arbitrary output phase constants and /, is the sampling ' 

frequency, and 0<iW/-<;;/2. 

In Fig. 2. the generation of JW* order harmonic, where M is an integer ^ 2. is shown. n.e tenn ^ order hannonics 
.sused for sirnpUdty.albeit the process generatesA/^order harmonics to 

which m most cases are themselves hannonics of unknown order. TTie input signal with the frequency domain 
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representation;r(/) is bandlimited to the range 0 to/^ 201. The signal contents in the range/_/A/to Qfn^iM 
where Q is the desired bandwidth expansion fartor 1< g ^ A/, is extracted by means of a bandpass filter, forming a 
bandpass signal with spectrum AsK/) 203. The bandpass signal is transposed a factor M, forming a second 
bandpass signal with spectrum X^f) covering the range/„„ to Qf^, 205. The spectral envelope of this signal is 
adjusted by means of a programme-controlled equaliser, forming a signal with spectrum Ai(/) 207. This signal is 
then combined with a delayed version of tite inpirt signal in order to compensate for the delay imposed by the 
bandpass filter and transposer. whereby an output signal with spectrum Y{f) covering the range 0 to Qf^ is formed 
209. Alternatively, bandpass filtering may be performed after the transposition M, using cut-off frequencics/„^ and 
Qf^. By using multiple transposers, simultaneous generation of different harmonic orders is of course possible. 
The above scheme may also be used to "fill in" stopbands wiUun the input signal, as shown in Fig. 3. where Uie 
input signal las a stopband extending from/o to e/&301. A passband \f,lhA,Qf,IM\ is then extracted 303, transposed 
a factor AY to [^ m 305, envelope adjusted 307 and combined with the delayed input signal forming the output 
signal with spectrum Yif) 309. 

An approximation of an exact transposition may be used. According to the present invention, the quality of such 
approximations is determined using dissonance theory. A criterion for dissonance is presented by Plomp ['Tonal 
Consonance and Critical BandwidU," R. Plomp. W. J. M. Levelt JASA , Vol 38, 1965], and states that two partials 
are considered dissonant if the frequency difference is within approximately 5 to 50% of the bandwidth of the 
critical band in which the partials are situated. For reference, the critical bandwidth for a given frequency can be 
approximated by 

c6(/) = 25 + 75(l+1.4(-^)')''*» 

1000 (3) 

with/and d, in Hz. Further, Plomp states that the hmuan auditory system can not discriminate two partials if they 
dififer in fi-equency by approximately less than five percent of the criUcal bandwidth in which they are situated. The 
exact transposition in Eq. 2 is approximated by 



1=0 



where A/; is the deviation fi-om the exact transposition. If the input partials form a harmonic series, a hypothesis of 
the invention states that the deviations from the harmonic series of the transposed partials must not exceed five 
percent of the critical bandwidth in which they are situated. TTus would explain why prior art methods give 
unsaUsfactory "harsh" and "rough" results, since broad band linear frequency shifts yields a much larger deviation 
than acceptable. When prior art methods produce more than one partial for only one input partial, the partials must 
nevertheless be within the above stated deviation limit, as to be perceived as one partial. TTiis again explains the 
poor resulU obtained with prior art methods using nonlinearities etc, since they produce intermodnlation partials not 
within the limit of deviation. 



When using the above transposition based method of spectral repUcation according to tl>e present invention, tl,e 
following important properties are achieved: 
- Normally, no frequency domain overlap occur between replicated harmonics and existing partials. 
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- Hie replicated partials are harmonicaUy related to the partials of the input signal and will not give rise to any 
annoying dissonance or artifacts. 

- The sp^tral eny^lppe % iSPJmdihOT19ni6S»fs)Hnsm*smooth continuation of the input signal spectral 
envelope, perceptually matching the original envelope. 

Transposition based on ti me-variant pnem search p rediMinn 

Various ways to design the required transposers exist. Typical time-domain implementations expand the signal in 
time by duplicating signal segments based on'ttie pitch^period. This signal is subsequently read out at a differem 
rate. Unfortunately such metiiods are stiicUy dependent on pitch-detection for accurate time splicing of the signal 
segments. Furtiieimore. tiie constraint to work on pitch-period based signal segments makes them sensitive to 
tiansients. Since Uie detected pitch-period can be much longer than tiie actual tiansient, tiie risk of duplicating tiie 
entire tiansient ratfier tiianjust expanding it in time is obvious. Anotiier type of time domain algoritimis obtains time 
expansion/compression of speech signals by utilising pattern search prediction of Uie output signal ["Pattern Search 
Prediction of Speech" R. Bogner, T Li, Proc. ICASSP '89, Vol. 1. May 1989, "Time-Scale Modification of Speech 
based on a nonlinear Oscillator Model" G. Kubin, W. B. Kleijn, EEE. 1994]. Hus is a form of gmnuiar synthesis 
where tfie input signal is divided into small parts, granules, used to synU,esise the output signal. Hus syntiiesis is ' 
usually done by performing correlation of signal segments in order to determine tiie best splicing points. ITus means 
0>at tiie segments used to form tfie output signal arc not dependent on flie pitch period, and Uius Uie non-trivial task 
of pitch detection is not required. Nevertiieless, problems wiUi rapidly changing signal amplitudes remain in tiiese 
metiiods, and high quahty transposition tends to raise high computational demands. However, an improved time- 
domain pitch shifter/transposer is now presented, where Oie use of transiem detection and dynamic system 
parameters produces a more accurate Uansposition for high transposition factors during both stationary (tonal or 
non-tonal) and transient sounds, at a low computational cost. 

Referring to tiie drawings wherein like nmnerals indicate like elements, there is shown in Fig. 4 nine separate 
modules: a transient^etector 401. a window position adjuster 403. a codebook generator 405. a synchronisation 
signal selector 407. a synchronisation position memory 409. a minimum difference estimator 41 1 an output 
segment memory 413. a mix miit 415. and a down sampler 417. Tlie input signal is fed to boUi tite codebook 
generator 405 and tiie tiansient-detector 40 1 . If a tiansient is detected, its position is sem to tiie window position 
module 403. This module stipulates tiie size and position of tiie window timt is multiplied witii tiie input signal when 
creatmg tiie codebook. The codebook generator 405 receives a synchronisation position from ttie sync select module 
407, provided it has been comiected to anotiier transposer. If tiiis synchronisation position is witiun tiie codebook it 
IS used and an output segment is produced. Otiienvise tiie codebook is sent to tiie minimum difference estimator 41 1 
which returns a new synchronisation positioa TTie new output segmem is windowed togetiier witii tiie previous 
output segment in tiie mix module 415 and subsequently down sampled in module 417. 

m order to clarify tiie explanation, a state space representation is intioduced. Here, tiie state vectors or granules 
represem tiie input and output signals. The input signal is represented by a statevector x(n): 

x{«) = [jc(«), x(n - D\ x(n - 2D) x^n -(N- 1)D)] (j) 
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which is obtained from delayed samples of the input signal, where is tlie dimension of the state vector and D is 
the delay between the input samples used to build the vector. The granular mapping yields the sample x(n) following 
each statevector x(n-l). This gives Eq. 6, where a(0 is the mapping: 

x(/7) = a{x(w-l)). (6) 

In the present method the granular mapping is used to determine the next output based on the former output, using a 
state transition codebook. The codebook of length L is continuously rebuilt containing tlie statevectors and the next 
sample following each statevector. Each statevector is separated from its neighbour by K samples; this enables Uie 
system to adjust the time resolution depending on the characteristics of tlie currently processed signal, where K 
equal to one represents the finest resoluUon. The input signal segment used to build Uie codebook is chosen based on 
the position of a possible transient and the synchronisation position in the previous codebook. 



This means that the mapping a(0, theoretically, is evaluated for aU transitions included in the codebook: 



15 



20 



25 



30 



35 



x(n-l) 



x(w-L+A:+1) 
x(n) 



(7) 



With this transiUon codebook, the new output y(n) is calculated by searching for Uic statevector in the codebook 
most similar to the current statevector y(n-l). This nearest-neighbour search is done by calculating the minimum 
difference and gives the new output sample: 

However, the system is not limited to work on a sample by sample basis, but is preferably operated on a segment by 
segment basis. The new output segment is windowed and added, mixed, with the previous output segment, and 
subsequently down sampled. The pitch transposition factor is determined by the ratio of the input segment length 
represented by the codebook and the output segment length read out of the codebook. 

Returning to the drawings, in Fig. 5 and Fig. 6 flowcharts are presented, displaying the cycle of operation of the 
transposer. In 501 the input data is represented, a transient detection 503 is performed on a segment of the input 
signal; the search for transients is performed on a segment length equal to the output segment length. If a transient is 
found 505, the position of the transient is stored 507 and the parameters L (representing the codebook length), K 
(representing the distance in samples between each statevector), and D (representing the delay between samples in 
each statevector) are adjusted 509. The position of the transient is compared to the position of the previous output 
segment 5 1 1, in order to determine whether the transient has been processed. If so 5 1 3, the position of the codebook 
(window LI and the parameters L, and D are adjusted 515. After the necessary parameter adjustments, based on 
the outcome of the transient detection, the search for a new synchronisation, or splicing point takes place 5 17. This 
procedure is displayed in Fig. 6. First a new synchronisation point is calculated based on the previous 601, 
according to: 

Sync j>os = Sync jposj>ld + SM - 5, (9) 

where Syncj)os and Syncj)os_old2iX^ the new and old synchronisation positions respectively, 5 is the length of tiie 
input segment being processed, and M is the transposition factor. This synchronisation point is used to compare the 
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accuracy of the new splicing point with the accuracy of the old splicing point 603. If the match is as good as or 
better than the previous 605, this new synchronisation point is returned 607 provided it is within the codebook. If 
. not. a new synchro^sation pom searchedfor in.the.loGp.609., This is performed wiara sirailanty measure, in this 
case a minimum difference function 611, however, it is also possible to use correlation in the time- or frequency- 
domain. If the position yields a better match than that of the previous position found 613 the synchronisation 
posiUon is stored 615. When all positions are tried 617 the system returns 619 to die flowchart in Fig. 5. Tlie new 
synchronisation point obtained is stored 519 and a new segment is read out from the codebook 521 starting at the 
given synchronisation point. This segmem is windowed and added to the previous 523, down sampled by the 
transposition factor 525, and stored in the output buffer 527. 

In Fig. 7 the behaviour of the system under transient conditions regarding the position of tlie codebook is illustrated. 
Prior to the transient, the codebook 1 representing the input segmem 1 is poshioned "to the left" of segment 1. 
Correlation segment 1 represents a part of the previous output and is used to find synchronisation point 1 in 
codebook 1. When the Uansient is detected, and the point of the transient is processed, the codebook is moved 
according to Fig. 7a and is staUonaiy until the input segmem currently being processed is once again "to the right" 
of the codebook. This makes it impossible to duplicate the transiem since the system is not allowed to search for 
synchronisation points prior to the transient. 

Most pitch transposers, or time expanders, based on pattern search prediction give satisfactory results for speech and 
single-pitched material. However, their performance deteriorates rapidly for high complexity signals, like music, in 
particular at large transposiUon factors. ITie present invention offers several solutions for improved performance 
therefore producing excellem results for any type of signal. Contrary to other designs, the system is time-variant and 
the system parameters are based on the properties of the input signal, and the parameters used during the previous 
operation cycle. The use of a transiem detector confrolling not only the codebook size and position, but also the 
properties of the stateveaors included, is a very robust and computationaUy efficient method to avoid audible 
degradation during rapidly changing signal segments. Furthermore. alteraUon of the length of the signal segment 
being processed, which would raise higher computational demands, is not required. Also, the present invention 
utihses a refined codebook search based on the results frem the preceding search. This means that contrary to an 
ordinary correlation of two signal segments, as is usually done in time-domain systems based on pattern search 
prediction, the most likely synchronisation positions are tried first instead of trying all positions consecutively This 
new method for reducing the codebook search drastically reduces the computational complexity of the system 
Further, when using seveml tiansposers, synchronisation position information can be shared among tiie tiansposeis 
for fiirther reduction of the computational complexity, as shown in the following implementation. 

Hie time-domain tiansposers as explained above are used to implemem tiie SBR- 1 and SBR-2 systems according to 
the foUowtng. illustrative but not limiting, example. In Fig. 8 three time expansion modules are used in order to 
generate second, tiurd and fourth order hannonics. Since, in this example, each time domain expansion /tiansposer 
works on a wideband signal, it is beneficial to adjust the spectral envelope of tiie source frequency range prior to 
transposition, considering that there wiU be no means to do so after tiie transpositions, without adding a separate 
equaliser system. TTxe spectial em^elope adjusters. 801. 803 and 805. each work on several filterbank channels The 
gam of each chamiel in die envelope adjusters must be set so that tiie smn. 813, 8 15. 817. at tiie output, after 
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transposition, yields the desired spectral envelope. The transposers 807, 809 and 811 are interconnected in order to 
share synchronisation position informaUon. This is based on the fact Uiat under certain conditions, a high correlation 
will occur between Uie synchronisation positions found in the codebook during correlation in the separate 
transposing units. Assume, as an example and again not limiting the scope of the invention, the fourth order 
5 harmonic transposer works on a time frame basis half of that of the second order harmonic transposer but at twice 
the duty cycle. Assume further, that the codebooks used for the two expanders are the same and that Uie 
synchronisation posiUons of the two time-domain expanders are labelled sync j)os4 and sync^os2, respectively. 
This yields the follovdng relation: 

sync _ posl = sync _ pos^ - ^ • 4 • ^ - sync _ offset , for «= 1 ,2,3 ,4 . , . , (10) 

10 where 

sync_offset = sync_pos^ ~ sync _ posl , for n=0, (i i) 

and S is the length of the input segment represented by the codebook. This is valid as long as neither of the 
synchronisation position pointers reaches the end of the codebook. During normal operaUon n is increased by one 
for each time-frame processed by the second order liarmonic transposer, and when the codebook end inevitably is 
15 reached, by either of the pointers, the counter n is set to /7=0, and sync _posl and sync ^os4 are computed 

individually. Similar results are obtained for the third order fiarmonic fiansposer when connected to the fourth order 
harmonic transposer. 

The above-presented use of several interconnected time-domain transposers, for the creation of higher order 
20 harmonics, introduces substantial computational reducuon. Furthermore, the proposed use of time-domain 

transposers in connection with a suitable filterbank, presents the opportunity to adjust the envelope of the created 
spectrum while maintaining the simplicity and low computational cost of a time domain transposer, since these, 
more or less, may be implemented using fixed point arithmeUc and solely additive/subtractive-operations. 

25 Other, illustrative but not limiting, examples of the present invention are: 

- the use of a time domain transposer within each subband in a subband filter bank, thus reducing the signal 
complexity for each transposer, 

- the use of a time domain transposer in combination with a frequency domain transposer, thus enabling tlie 
system to use different methods for transposition depending on the characteristics of the input signal being 

30 processed, 

- the use of a time domain transposer in a wideband speech codec, operating on for instance the residual signal 
obtained after linear prediction. 

It should be recognised that the method outlined above may be advantageously used for timescale modification only, 
by simply omitting the sample rale conversion. Further it is understood, that although the outlined method focuses 
35 on pitch transposing to a higher pitch, i.e. time expansion, die same principles apply when transposing to a lower 
pitch, i.e. time compression, as is obvious to those skilled in die art. 
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Filter bank based transp osition 

Various new and innovative filter bank based transposition techniques will now be described. The signal to be 
transppsed is di>gded,intp,a..ssnes of BP-.0Rsubband signals. TTie subband sigii^ls are th^n'ti^iisposed. exact or 
approximately, which is advantageously accomplished by a reconnection of analysis- and synthesis subbands 
hereinafter referred to as a "patch". Tlie method is first demonstrated using a Short Time Fourier Transform, STFT. 

The N-point STFT of a discrete-time signal x(n) is defined by 

00 

= J^x(p) h{n-p) e-""'", (12) 

p=-to 

where ^ = 0,1,...;^-1 and co* = 2nk/Nand /,(«) is a window. If U>e window satisfies the following conditions 

r/»(0) = l 

\hin) = 0 for n = +N,±2N,±3N,... ^^^^ 
an inverse transform exists and is given by 



k=0 



The direct transform may be interpreted as an analyser, see Fig. 9a, consisting of a bank of A^BP-filters with impulse 
responses /,(«)exp(,«,«) 901 followed by a bank of ^multipUers with carriers exp(-y^«) 903 which shift the BP- 
signals down to regions aromid 0 Hz, forming the^^analysis signals^,(«). Tlie window acts as a prototype LP-filter 
X^n) have small bandwidflis and are normaUy downsampled 905. Eq. 12 need Uius only be evaluated at « = r« 

where/? is flie decimation factor andristiie new time.varlable.X^«)can be recovered from;^*(r^) by upsampiin^ 
see Fig 9b, i.e. insertion of zeros 907 followed by LP-filtering 909. TTie inverse transform may be interpreted as a 
syntiiesiser consisting of a bank of multipliers with carriers (1/A0exp0 «*n) 91 1 Uiat shift the signals X^n) up to 
their ongmal frequencies, followed by stages 913, Fig. 9c. that add the contributions >.^„) from all chamiels. The 
STFI and ISTFT may be rearranged in order to use the DPT and IDFT. which makes Uie use of FFT algorithms 

possible ["Implementation oftiie Phase VocoderusingtheFastFourierTransform" MRPortnoff lEEEASSP 
Vol. 24, No. 3. 1976]. 

Fig. 9c shows a patch 915 for generation of second harmonics, M= 2. withyv= 32. For the sake of simplicity only 
Channels 0 through ,6 are shown. TTie centre frequency of BP 16 equals the Nyqvist frequency, channels 17 Uirough 
31 correspond to negative frequencies. The blocks denoted P 9 17 and the gain blocks 919 will be described later and 
should presenUy be considered shorted out. Tlie input signal is in this example bandlimited so that only channels 0 
through 7 contain signals. Analyser channels 8 Uuough 16 are thus empty and need not be mapped to the 
synthesiser. Analyser chamiels 0 ti^ough 7 are comiected to synthesiser chamiels 0 through 7, conesponding to an 
mput signal delay path. Analysis channels A: where 4.*. 7 are also connected to syntiiesis channels M=2 
which shift the signals to frequency regions at two times the centre-frequencies of BP filters k. Hence, the signal's 

are upshifted to their original ranges as well as transposed one oaave up. Toexplore the hann^^ 
tern, of r^l-valued filter responds and modulators thenegativ^ 

branch of Fig. 10a. Hence, the combined output of the remapping k-^Mk 1001 and N-k N-Mk 1003 where 4^ k 
^ must be evaluated 
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This yields 

y(n) = ~[x(n) * h{n)cos{Q},,n)]cos{{M - l)fi?;tn)) + 

2 (15) 

- -— [x(w) * /i(rt)sin(6);tn)]sin((M - 1)0) i^n) 
N 

where M= 2. Eq. 15 may be interpreted as a BP-filteiing of the input signal, followed by a linear frequency shift or 
Upper Side Band (USB) modulation, i.e. single side band modulation using tlie upper side band, see Fig. 10b, where 
5 1005 and 1007 form a Hilbert transformer, 1009 and 1011 are multipliers with cosine and sine carriers and 10 1 3 is a 
difference stage which selects the upper sideband. Clearly, such a multiband BP and SSB method may be 
implemented explicitly, i.e. without filterbank patching, in the time or frequency domain, allowing arbiuary 
selection of individual passbands and oscillator frequencies. 

10 According to Eq. 15, a sinusoid with the frequency Oi within the passband of analysis channel k yields a harmonic at 
the frequency Mtt\+(G:^ - Wk)- Hence the method, referred to as basic multiband transposition, only generates exact 
harmonics for input signals with frequencies cOi = av, where 4^k^7, However, if tlie number of filters is 
sufficiently large, the deviation from an exaa transposition is negligible, see Eq. 4. Furtlier, the transposition is 
made exact for quasi-stationary tonal signals of arbitrary frequencies by inserting the blocks denoted P 917 (Fig. 9c), 

1 5 provided every analysis channel contains maximum one partial. In this case Xk(rR) are complex exponentials with 
frequencies equal to the differences between the partial frequencies oji and the centre frequencies Wk of the analysis 
filters. To obtain the exact transposition A/, these firequencies must be increased by a factor A/, modifying the above 
frequency relationship to -> Mcok-^M(G)! - cok) = Mo>i, The frequencies ofXk(rR) are equal to the time derivatives 
of their respective unwrapped phase angles and may be estimated using first order differences of successive phase 

20 angles. The frequency estimates are multiplied by M and synthesis phase angles are calculated using tliose new 

frequencies. However, the same result, aside from a phase constant, is obtained in an simplified way by multiplying 
the analysis arguments by M directly, eliminating the need for frequency estimation. This is described in Fig. 1 1, 
representing the blocks 917. Thus Xk(fRl where 4 < /r < 7 in this example, are converted from rectangular to polar 
coordinates, illustrated by the blocks R P, 1101. The arguments are multiplied by A/ = 2 1 103 and the magnimdes 

25 are unaltered. The signals are then converted back to rectangular coordinates (P R) 1 105 forming the signals 
YMk(rR) and fed to synthesiser channels according to Fig. 9c. This improved multiband transposition method thus 
has two stages: The patch provides a coarse transposition, as in the basic method, and the phase-muhipliers provide 
fine frequency corrections. The above multiband transposition methods differ from traditional pitch shifting 
techniques using the STFT, where lookup-table oscillators are used for the synthesis or, when the ISTFT is used for 

30 the synthesis the signal is time-stretched and decimated, i.e. no patch is used. 

The harmonic patch of Fig. 9c is easily modified for other transposition factors than two. Fig, 12 shows a patch 1203 
for generation of 3'** order harmonics, where 1201 are the analysis channels and 1205 are the synthesis channels- 
Different harmonic orders may be created simultaneously as shown in Fig. 13, where 2™* and 3"* order harmonics are 
35 used. Fig. 14 illustrates a non-overlapping combination of 2"^ 3'** and 4*** order harmonics. The lowest possible 
harmonic number is used as high in frequency as possible. Above the upper limit of the destination range of 
harmonic A/, harmonic A/+1 is used. Fig. 15 demonstrates a method of mapping all synthesiser channels (A^ = 64, 
channels 0-32 shown). All highband channels with non prime-number indices are mapped according to the 
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following relation between source and destination channel number: k^est = Mc«„,« where A/ is the smallest integer >2 
that satisfies the condition that lies in the lowband and k^^t in the highband. Hence, no synthesiser channel 
receives signal from more than one analysis channel. Prime-number higliband cliannels may be mapped to ksource = 1 
or lowband channels Kaurce > 1 that yield good approximations of the above relation (Only non-prime number 
connections with A/= 2, 3, 4, 5 are shown in Fig. 15). < 

It is also possible to combine ampUtude and phase information from different analyser channels. The amplitude 
signals |r^r/?)| may be connected according to Fig. 16, whereas the phase signals aTg{X,irR)} are connected 
according to the principle of Fig. 16. In this way the lowband frequencies will still be transposed, whereby a 
periodic repetition of the source region envelope is generated instead of the stretched envelope that results from a 
transposition according to Eq. 2. Gating or odier means may be incorporated in order to avoid amplification of 
"empty" source channels. Fig. 17 iUustrates another application, the generation of sub-harmonics to a highpass 
filtered or bass limited signal by using connections from higher to lower subbands. When using the above 
transpositions it may be beneficial to employ adaptive switching of patch based on tlie characterisUcs of the signal. 

In the above description it was assumed that the highest frequency contained in Uie input signal was significantly 
lower than the Nyqvist frequency. Thus, it was possible to perform a bandwidth expansion without an increase in 
sample rate. This is however not always the case, why a preceding upsampUng may be necessary. When using filter 
bank methods for transposition, it is possible to integrate upsampling in the process. 

Most perceptual codecs employ maximally decimated filter banks in the time to frequency mapping ["Introduction 
to Perceptual Coding*' KL Brandenburg, AES, Collected Papers on Digital Audio Bitxate Reduction, 1996]. Fig. 18a 
shows the basic structure of a perceptual encoder system. The analysis filter bank 1 80 1 spUts the input signal into 
several subband signals. The subband samples are individually quantised 1803, using a reduced number of bits, 
where the number of quantization levels arc determined from a perceptual model 1807 which estimates Uie 
minimum masking threshold. The subband samples are normalised, coded with optional redundancy coding methods 
and combined with side information consisting of the normalisation factors, bit-allocation information and other 
codec specific data 1805, to form the serial bit stream. The bit stream is then stored or transmitted. In the decoder. 
Fig. 18b, the coded bitstream is demultiplexed 1809, decoded and the subband samples are re-quantised to the equal 
number of bits 18 1 1. A synthesis filter bank combines the subband samples in order to recreate the original signal 
1813. Implementations using maximally decimated filter banks will drastically reduce computaUonal costs. In the 
following descriptions, there is a focus on cosine modulated filter banks. It should be appreciated however, that the 
invention can be implemented using other types of filter banks or transforms, including filter bank interpretaUons of 
the wavelet transform, other non-equal bandwidth filter banks or transforms and multi-dimensional filter banks or 
transforms. 



In the iUuslrative, but not limiting, descriptions below it is assumed that an L-channel cosine modulated filter bank 
spUts the input signal x{n) into L subband signals. TTie generic structure of a maximally decimated filter bank is 
shown in Fig. 19. The analysis filters are denoted H^z) 1901, where ^ = 0, 1„ . LA, Hie subband signals v,{n) are 
maximaUy decimated 1903, each of sampling frequency/yz,, where/, is the sampling frequency of x(n). The 
synthesis section reassembles the subband signals after interpolaUon 1905 and filtering 1907 to produce x{n) . The 
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synthesis filters are denoted In addition, the present invention performs a spectral replication on x(n) . giving 
an enhanced signal X")- 

Synthesising the subband signals with a OL-channel filter bank, where only the L lowband channels are used and the 
bandwidth expansion factor Q is chosen so that QL is an integer value, will result in an output bit stream with 
sampling frequency Qf.. Hence, the extended filter bank will act as if it is an L-channel filter bank followed by an 
upsampler. Since, in this case, the L{Q.l) highband filters are unused (fed with zeros), the audio bandwidth will not 
change - the filter bank will merely reconstruct an upsampled version of x(«) . If, however, the L subband signals 
are patched to the highband filters, the bandwidth of i(/7) will be increased by a factor Q, producingK«). This is die 
maximaUy decimated filter bank version of the basic multiband transposer, according to the invention. Using this 
scheme, the upsampling process is integrated in the synthesis filtering as explained earlier. It should be noted tliat 
any size of the synthesis filter bank may be used, resulting in different sample-rates of the output signal, and hence 
different bandwidth expansion factors. Performing spectral replication oni(«) according to the present invention of 
the basic multiband transposition method witli an integer transposition factor^ is accomplished by patching U»e 
subband signals as 

(") = «A« («) (-1)^^-'^*" V, («) , (16) 

where k e [0,1-1] and chosen so ihalMk e IL,QL-1], ej^n) is the envelope correction and (-1)^ '^ is a correcUon 
factor for spectral inverted subbands. Spectral inversion results from decimation of subband signals, and the inverted 
signals may be reinverted by changing sign on every second sample in those channels. Referring to Fig. 20, consider 
an I6-channeI synthesis filter bank, patched 2009 for a transposition fectorM= 2, witli ^ = 2. The blocks 2001 and 
2003 denote the analysis filters H^z) and the decimators of Fig. 19 respectively. Similarly, 2005 and 2007 are the 
interpolators and synthesis filters F>^z). Eq. 16 then simplifies to patching of the four upper ftequency subband 
signals of the received data into eveiy second of the eight uppermost channels in the synthesis filter bank. Due to 
spectral inversion, every second patched subband signal must be frequency inverted before the synthesis. 
Additionally, tiie magnitudes of the patched signals must be adjusted 20 1 1 according to tiie principles of SBR-1 or 
SBR-2. 



iarem 
i m some 



Using the basic multiband transposition method according to the present invention, the generated harmonics i 
general not exact multiples of tiie fimdamentals. AH frequencies but the lowest in every subband differs ir 
extent from an exact transposition. Further, the replicated spectrum contains zeros since the target interval covers a 
wider frequency range than the souree interval. Moreover, Uie alias cancellation properties of the cosine modulated 
filter bank vanishes, since the subband signals are separated in fi^equency in the target interval. TTiat is, neighbouring 
subband signals do not overlap in tiie high-band area. However, aliasing reduction metiiods, known by tiiose skilled 
m the art, may be used to reduce tfus type of artifacts. Advantages of tiiis tiansposition metiiod are ease of 
implementation, and flie very low computational cost. 

To achieve perfect tiansposition of sinusoids, an effective maximally decimated filter bank solution of tiie improved 
multiband transposition metiiod is now presented. TTie system uses an additional modified analysis filter bank, while 
the syntiiesis filter bank is cosine modulated as described by Vaidyanatiian ("Multirate Systems and FUter Banks" 
P. P. Vaidyanatiian, Prentice Hall, Englewood Cliffs. New Jersey. 1993. ISBN 0-13-605718-7]. TTie steps for 
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operation, using the improved multiband transposition method according to the present invention, based 
maximaUy decimated fiher banks, are shown schematicaUy in Fig 2 1 and in tlie flow-chart of Fig. 22 and 



follows: 



on 
areas 



1 . The L received subband signals are synthesised with a G^-channel filter bank 210 1, 2201, 2203, where the 
L{QA) upper channels are fed with zeros, to form signal Mn). which is tiius oversampled by the bandwidth 
expansion factor Q. 

2. xi(n) is downsampied by a factor O, to form signal X2(n') 2103, 2205, i. e. X2{n-) = Xi{Qn'). 

3. An integer-value K is chosen as the size of a synthesis filter bank, constrained so that T-KM/Q is an integer, 
where 7 is the size of the modified analysis filter bank andM is the transposition factor 2207, 2209, 2211. 
should preferably be chosen large for stationaiy (tonal) signals, and smaller for dynamic (transient)'signals. 

4. X2{n') is filtered through a T-channel modified analysis filter bank 2107, 2213, where the 7" analysis filters are 
exponentially modulated, producing a set of complex-valued subband signals. The subband signals are 
downsampied by a factor T/M, giving subband signals v^n"), * = 0, 1,..., 7-1. Hence, the fiher bank vill 
be oversampled by a factorM 

5. The samples v^"") are converted to a polar representation (magnitude and phase-angle). The phase-angles 
are multiplied by the factor M, and the samples are converted back to a rectangular representation according 
to the scheme of Fig. 1 1. The real parts of the complex-valued samples are taken, giving the signals s^n") 
2109, 2215. After this operaUon, the signals st^(n") are critically sampled. 

6. nie gains of the signals s,<^(n") are adjusted according to the principles of SBR-1 or SBR-2 2111. 2217. 

7. 'n>esubbandsignals.A"'0,where*Mr/M,min(/:.7).l],aresynthesisedwithanordin^ 
modulated /:-chamiel filter bank, where the channels 0 through T/M-l are fed with zeros 2105, 2221. TTus 
produces the signal x^{n). 

8. x^in) is finally added to x.(/i) to give y(n) 2223, which is the desired spectral replicated signal. 

Steps 3 to 6 may be repeated for different values of the transposition factor thus adding muWple harmonics to 
x,(«). nus mode of operation is Ulustrated by the dotted figures of Fig. 21, and in Fig. 22, by iterating the loop over 
boxes 2211 - 2219. In this case. K is chosen as to make 7 integer-valued for all choices of M- for integer valued 
Ms; preferably select K as to make KIQ a posiUve integer. All subband signals. r'>(„-) , «here / = 1, 2,., .. m, and 
w IS the number of transposition factors, are added accordmg to 



(17) 



for every appUcable k. h. the fir« iteraUon of the loop of Fig. 22, the signals s^,') may be considered to be subband 
samples of zeros only, where = 0, 1 K-Um every loop, the new samples are added 2219 to s^n") as 



=j»(w')+4*^'\«'), 



(18) 



where k = KIQ, KIQ.I,..., min(i:,7;.)-l. Tl.e subband signals s^n'') are synthesised once with a i^-channel filter 
bank according to step 7. 
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The modified analysis filter bank of step 4, is derived through the theory of cosine modulated filter banks, where the 
modulated lapped transform (MLT) ("Lapped Transforms for Efficient Transfonn/Subband Coding" H. S. Malvar, 
IEEE Trans ASSP, vol. 38, no. 6, 1990] is a special case. Tlie impulse responses h^n) of the filters in a T-channel 
cosine modulated filter bank may be written 



(«) = Cpo ('»)cosr:^ {2k + 1)(/» -^) + d), 
llT 2 



(19) 



where A:-0. 1,.... M.A^ is the length of the lowpass prototype filter po(/J). C is a constant and <D» is a phase-angle 
that ensures alias cancellation between adjacent channels. The constraints on 0)* is 

<I>o = ±j . Or., = ± J and 0)^ = O,., ±| (20a-c) 
which may be simplified to the closed form expression 

<Dt=±(-l)*^- 



4 



(21) 



With this choice of <P* , perfect reconstruction systems or approximate reconstruction systems (pseudo QMF 
systems) may be obtained using synthesis filter banks with impulse responses as 



/*(") = C />o («) cosf^ {2k + l)(/i - ^) - CD, 
\_2T 2 



(22) 



Consider the filters 

(23) 



where /»'*(/») are sine-modulated versions of the prototype filter po(w). The filters //^(z) and H^z) have identical 
passband supports, but the phase responses differ. The passbands of the filters are actually Hilbert transforms of 
each other (this is not valid for frequencies dose to 0= 0 and oj= ti). Combining Eq. 19 and Eq. 23 according to 



a; (n) = h^ (n) + Jh'^ («) = Cpo (n) exp 



(24) 



yields filters that have the same shape of the magnitude responses as H^z) for posiUve frequencies but are zero for 
negative frequencies. Using a filter bank with impulse responses as in Eq. 24 gives a set of subband signals that may 
be interpreted as the analytic (complex) signals coiresponding to the subband signals obtained from a filter bank 
with impulse responses as m Eq. 19. Analytic signals are suitable for manipulation, since the complex-valued 
samples may be written in a polar form, that is z(/») = r(»)+y ,(«) = |z(«)|exp{/- arg(z(«))}. However, when using the 
25 complex filter bank for transposition, the constraint on <iV has to be generalised to retain the alias cancellation 
property. The new constraint on , to ensure alias cancellation in combination with a synUiesis filter bank with 
impulse responses as in Eq. 22 is 

<Dt = ±(-1)* — 

which simplifies to Eq. 21 whenA/= 1. With this choice, tiansposed partials will have the same relative phases as 
30 they would have when M=\ (no U-ansposition). 
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Combining Eq. 24 and Eq. 25 results in 



(2^+1) ,„ ^-1,,(-1)* 
L 4m 



which are the filters used in Uie modified filter bank of step 4, according to the present invention. 

Some clarifications concerning step 5: downsampling the complex-valued subband signals by a factor TIM makes 
them oversampled by M which is an essential criterion when the phase-angles subsequently are multiplied by the 
transposition factor M The oversampling forces the number of subband samples per bandwidth, after transposiUon 
to the target range, to equal that of the source range. Tlie individual bandwidths of the transposed subband signals 
are M times greater than those in the source range, due to the plase-multiplier. Tlus makes the subband signals 
critically sampled after step 5, and additionally, there wiU be no zeros in the spectnim when transposing tonal 
signals. 



In order to avoid trigonometric calculations, that is, having to compute the new subband signal 



s as 



4^)(«") = real. 



exp 



jM arctan 



'imag{v[^\,;")}' 



reaKv[^>(;/)} 



cos 



A/ arctan 



^mag{v[^\«0}^ 



I reaKvJ^^;,")} J 



(27) 



Where |vr(«")l is the absolute value of n^""), the following trigonometric relationship is used: 

cos(Wa) = cos^(a)-(f)sin2(a)cos^-2(a) + (f )sin4(a)cos^-^a)-.... (28) 

Letting 



« = arctan 



I, reaHv^**) («')}, 



(29) 



and noting that 



cos(a) = cos(arctan 



unag{vi^'(/i')} 



and 



sin(a) = sin(arctan 



reaKv[*^> (/,")} J 
imag{vf^5(w'')}^ 



) = 



real{vf^> (/,')} 

k*^'("')l 



(30) 



real{vf^>(/i")} j 



^_ imag{v<^)(w')} 
\>T\n'i 



(31) 



the computations of step 5 may be accomplished without trigonometric calculations, reducing computational 
complexity. 
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When using transpositions where A/ is even, obstacles with the phase-muhiplier may arise, depending on the 
characteristics of the lowpass prototype filter /7o(/i). AU applicable prototype filters have zeros on the unit circle in 
the z-plane. A zero on Uie unit circle imposes a 180° shift in the phase response of the filter. For M even, the phase- 
multiplier translates these shifts to 360*> shifts; i.e. the phase-shifts vanish. The partials so located in frequency that 
5 such phase-shifts vanish will give rise to aliasing in the synthesised signal. Tlic worst case scenario is when a partial 
is located at a point in frequency corresponding to the top of the furst side lobe of an analysis filter. Depending on 
the rejection of tliis lobe in the magnitude response, Oie aliasing will be more or less audible. As an example, the 
first side lobe of the prototype filter used in the ISO/MPEG layer 1 and 2 standard is rejected 96 dB, while the 
rejection is only 23 dB for the first side lobe of the sine-window used in tlie MDCT scheme of the ISO/MPEG layer 
10 3 standard. It is clear, that this type of aliasing, using the sine-window, will be audible. A solution to tliis problem 
will be presented, and is referred to as relative phase locking. 

The filters /(V) all have linear phase responses. Tlie phase-angles Ok introduce relative phase differences between 
adjacent channels, and the zeros on the unit circle introduce 180° phase-sliifts at locations in frequency that may 

1 5 differ between channels. By monitoring the phase-difference between neighbouring subband signals, before the 
phase-multiplier is activated, it is easy to detect the channels that contain phase-inverted information. Considering 
tonal signals, the phase-difference is approximately jdlM, according to Eq. 25, for non-inverted signals, and 
consequently approximately 7i{\-\l2M) for signals, where either of the signals is inverted. The detection of inverted 
signals may be accomplished simply by computing the dot product of samples in adjacent subbands as 

20 v^^hrt^W^^^in") = x^[vf'\n^)}x^z]{v'^^^(n^)}^^^^ . (32) 

If the product in Eq. 32 is negative, the phase-difference is greater than 90°, and a phase-inversion condition is 
present. The phase-angles of the complex-valued subband signals are multiplied by M, according to the scheme of 
step 5, and fmally, the inversion-tagged signals are negated. The relative phase locking method thus forces the 180° 
shifted subband signals to retain this shift after the phase-multiplication, and hence maintain the aliasing 
25 cancellation properties. 

Spectral envelope adjustment 

Most sounds, like speech and music, are characterised as products of slowly varying envelopes and rapidly varying 
carriers with constant amplitude, as described by Stockham ["The Application of Generalized Linearity to 
Automatic Gain Control" T.G. Stockham, Jr, IEEE Trans, on Audio and ElectroacousUcs, Vol. AU-16, No. 2, June 
1968] and Eq. 1, ' 

In split-band perceptual audio coders, the audio signal is segmented into fnmies and split into multiple frequency 
bands using subband filters or a time-to-frequency domain transform. In most codec types, the signal is 
subsequently separated into two major signal components for transmission or storage, the spectral envelope 
representation and the noimaliscd subband samples or coefficients. Througliout the following descripUon, the term 
"subband samples" or "coefficients" refers to sample values obtained from subband filters as well as coefficients 
obtained fi-om a time-to-frequency transform. The term "spectral envelope" or "scale factors" represent values of the 
subbands on a time-frame basis, such as the average or maximum magnitude in each subband, used for 
normalisaUon of the subband samples. However, the spectral envelope may also be obtained using linear prediction 
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LPC, [U.S. Pat. 5,684,920]. In a typical codec, Uie nomalised subband samples require coding at a high bitrate 
(using approximately 90% of the available bitrate). compared to the slowly varying temporal envelopes, and thus the 
spectral envelopes, that may be coded at.a much^reduced rate (using approximatdy-10% of the available bitrate). 

Accurate spectral envelope of the repUcated bandwidth is important if die timbial qualities of the original signal are 
to be preserved. Hie perceived timbre of a musical instrument, or voice, is mainly determined by the spectral 
distribution below a frequencyyij^ located in .the highest octaves of hearing. The spectral details above^fi^ are thus 
of less importance, and consequently the highband fme' structures obtained by the above transposition methods 
require no adjustment, while the coarse structures generally do. In order to enable such adjustment, it is usefiil to 
filter the spectral representation of the signal to separate the envelope coarse structure from the fme structure. 

In the SBR-1 impIememaUon according to the present invention, Uie highband coarse spectral envelope is estimated 
from the lowband information available at the decoder. TTiis estimation is performed by continuously monitoring the 
envelope of the lowband and adjusting the highband spectral envelope according to specific rules. A novel method 
to accomplish the envelope estimation uses asymptotes in a logarithmic frequency-magnitude space, which is 
equivalent to curve fitting with polynomials of varying order in the linear space. The level and slope of an upper 
portion of the lowband spectnim are esUmated, and the estimates are used to define the level and slope of one or 
several segments representing the new highband envelope. The asymptote intersecUomj are fixed in frequency and 
act as pivot points. However not always necessary, it is beneficial to stipulate constraints to keep the highband 
envelope excursions within reahsUc boundaries. An alternative approach to estimation of the spectral envelope is to 
use vector quantization, VQ, of a large nmnber of representative spectral envelopes, and store these in a lookup- 
table or codebook. Vector quantization is performed by training the desired number of vectors on a vast amount of 
training data, in tiiis case audio spectral envelopes. The training is usually done with the Generalised Lloyd 
Algorithm ["Vector Quantization and Signal Compression" A. Gersho, R. M. Gray, Kluwer Academic Publishers 
USA 1992, ISBN 0-7923-9181-0], and yields vectors that optimally cover Uie contems of the training data. 
Considering a VQ codebook consisting of A spectral envelopes uained by B envelopes (B » A), then the A 
envelopes represent the A most likely transitions from the lowband envelope to the highband envelope, based on B 
observations of a wide variety of sounds. This is, theoretically, the A optimum rules for predicting tiie envelope 
based on Uie B observations. When estimating a new highband spectral envelope, tiie original lowband envelope is 
used to search the codebook and the highband part of the best matching codebook entry is applied to create the new 
highband spectrum. 

In Fig. 23, flie nonnalised subband samples are represented by 2301 and Uie spectral envelopes are represented by 
Uie scalefactors 2305. For iUustiative purposes tiie transmission to decoder 2303 is shown in parallel form. In die 
SBR-2 metiiod Fig. 24, Uie spectral envelope information is generated and tiBnsmitted according to Fig. 23, whereby 
only Uie lowband subband samples are transmitted. Transmitted scalefactoi. Uius span Uie fiill frequency range 
while Uie subband samples only span a restiicted frequency range, excluding Uie highband. At flie decoder Uie 
lowband subband samples 2401 are transposed 2403 and combined wiUi Uie received highband spectial envelope 
mformauon 2405. In Uiis way Uie synUietic highband spectial envelope is identical to Uiat of ttie original, while 
mamtaining a significant bit rate reduction. 
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In some codecs, it is possible to transmit the scalefactors for the enUre spectral envelope while omitUng the 
highband subband samples, as shown in Fig. 24. Other codec standards stipulate that scalefactors and subband 
samples must cover the same frequency range, i.e. scale-factors cannot be transmitted if the subband samples are 
omitted. In such cases, there are several solutions; the highband spectral envelope information can be transmitted in 
separate frames, where the frames have their own headers and optional error protection, followed by the data. 
Regular decoders, not taking advantage of the present invention, will not recognise Uie headers and therefore discard 
the extra frames. In a second solution, tlie highband spectral envelope' information is transmitted as auxiliary data 
within the encoded bitstream. However, the available auxiliary data field must be large enough to hold the envelope 
information. In cases where none of the first two solutions are adaptable, a third solution, where the highband 
spectral envelope information is hidden as subband samples, may be appUed. Subband scalefactors cover a large 
dynamic range, typically exceeding 100 dB. It is thus possible to set an arbirary number of subband scalefactors. 
2505 in Fig. 25. to very low values, and to transmit the highband scalefactors "camouflaged" as subband samples, 
2501. This way of transmitting the highband scale factors to the decoder 2503 ensures compaUbility with the 
bitstream syntax. Hence, arbitrary data may be transmitted in this fashion. A related method exists where 
15 information is coded into the subband sample stream [U. S. Pat. 5,687,191). A fourth soluUon, Fig. 26. can be 
appUed when a coding system uses Huffman- or other redundancy coding 2603. The subband samples for Uie 
highband is then set to zero 2601 or a constant value as to achieve a high redundancy. 
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Transient response improvement.'; 



Transient related artifacts are common problems in audio codecs, and similar artifacts occur in the presem invention. 
In general, patching generates spectral "zeros" or notches, corresponding to time domain pre- and post-echoes, i.e. 
spurious transients before and after "true" transients. Albeit the P-blocks "fill in Uie zeros" for slowly varying tonal 
signals, the pre- and post-echoes remain. The improved mulUband method is intended to work on discrete sinusoids, 
where the number of sinusoids is restricted to one per subband. Transients or noise in a subband can be viewed as a 
25 large number of discrete sinusoids within that subband. Tliis generates intermodulation distortion. These artifacts are 
considered as additional quantization-noise sources connected to the rephcated highband channels during transient 
intervals. Traditional methods to avoid pre- and post-echo artifacts in perceptual audio coders, for example adaptive 
window switching, may hence be used to enhance the subjective quality of the improved multiband method. By 
using the transiem detection provided by the codec or a separate detector and reducing the number of channels under 
transient conditions the "quantization noise" is forced not to exceed the time-dependent masking threshold. A 
smaUer number of channels is used during transient passages whereas a larger is used during tonal passages. Such 
adaptive window switching is commonly used in codecs in order to trade frequency resolution for time resolution. 
Different methods may be used in appUcations where the filterbank size is fixed. One approach is to shape tiie 
"quantization noise" m time via linear prediction in the spectial domain. The transposition is then performed on Uie 
residual signal, which is the output of the linear prediction filter. Subsequently, an inverse prediction filter is applied 
to the original- and spectral replicated channels simultaneously. Another approach employs a compander system i.e. 
dynamic amplitude compression of the toansient signal prior to transposition or coding, and a complementary 
expansion after transposition. It is also possible to switch between transposition methods in a signal dependent 
manner, for example, a high resolution filterbank uansposition mefliod is used for stationary signals, and a time- 
40 variant pattern search prediction method is employed for transient signals. 
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Practical implenientations 

Using a standard signal-processor or a powerful PC, real-time operation of a SBR-enhanced codec is possible. The 
SBR enhanced codec may also be hard-coded on a custom chip. It may also be implemented in various kinds of 
systems for storage or transmission of signals, analogue or digital, using arbitrary codecs. Fig. 27 and Fig. 28. Tlie 
SBR-1 method may be integrated in a decoder or supplied as an add-on hardware or software post-processing 
module. The SBR-2 method needs additional modification of the encoder. In Fig. 27 the analogue input signal is fed 
to the AAD-converter 2701, forming a digital signal which is fed to die an arbitrary encoder 2703, where source 
coding is performed. The signal fed into the system may be of such a low-pass type that spectral bands within the 
auditory range already have been discarded, or spectral bands are discarded in the arbitrary encoder. Tlie resulting 
lowband signals are fed to the multiplexer 2705. forming a serial bitstream which is transmitted or stored 2707. Tbe 
de-multiplexer 2709 restores the signals and feeds them to an arbitrary decoder 27 11 . The spectral envelope 
information 2715 is estimated at the decoder 2713 and fed to the SBR-1 unit 2713 which transposes the lowband 
signal to a highband signal and creates an envelope adjusted wideband signal. Finally, the digital wideband signal is 
converted 2717 to an analogue output signal. 

The SBR-2 method needs additional modificaUon of the encoder. In Fig. 28 the analogue input signal is fed to the 
A/D-converter 2801. forming a digital signal which is fed to tlie an arbitrary encoder 2803, where source coding is 
performed. THe spectral envelope information is extracted 2805. TTie resulting signals, lowband subband samples or 
coefficients and wideband envelope information, are fed to the mulUplexer 2807, forming a serial bitsUeam which is 
transmitted or stored 2809. The de-multiplexer 2811 restores the signals, lowband subband samples or coefficients 
and wideband envelope information, and feeds them to an arbitrary decoder 2815. The spectral envelope 
information 2813 is fed from the di-multiplexer 281 1 to the SBR-2 unit 2817 which transposes the lowband signal 
to a highband signal and creates an envelope adjusted wideband signal. Finally, the digital wideband signal is 
converted 28 19 to an analogue output signal. 

When only very low bitratcs are available, (Internet and slow telephone modems, AM-broadcasting etc.) mono 
coding of the audio program material is unavoidable. In order to improve the perceived quality and make the 
programme more pleasant sounding, a simple "pseudo-stereo" generator. Fig. 29. is obtained by the introduction of a 
tapped delayline 2901. This may feed 10ms and 15ms delayed signals at approximately -6dB 2903 to each output 
chamiel in addiUon to the original mono signal 2905. The pseudo-stereo generator ofiFers a valuable perceptual 
improvement at a low computational cost. 



35 



TTie above-described embodiments are merely illustrative for the principles of the present invention for audio source 
coding improvement It is miderstood that modifications and variations of the arrangements and the details described 
herein wiU be apparcm to others skiUed in the art. It is the intent, therefore, to be limited only by the scope of the 
impending patent claims and not by the specific details presented by way of description and explanation of the 
embodiments herein. 



wo 98/57436 



23 



PCT/IB98/00893 



CLAIMS 

1. A method for enhancement of a source coding system where said source coding system comprises an encoder 
representing all operations performed prior to storage or transmission, and a decoder representing all operations 

5 performed after storage or transmission, characterised by: 

at said encoder, discarding frequency band(s) of an original signal, forming a first signal; 
at said decoder, by means of transposition performed on said first signal, replicating frequency band(s) of 
said original signal, forming a second signal; and 

combining said first and said second signal, forming an output signal; whereby reduced bitrate at a given 
10 perceptual quality or improved perceptual quality at a given bitrate is obtained. 

2. A method according to claim 1, characterised in that passband(s) of said second signal are set not to overlap or 
only partly overlap passband(s) of said first signal. 

15 3, A method according to claims 1-2, characterised in that spectral envelope adjustment is performed based on 
estimation of the spectral envelope of said discarded frequency band(s) of said original signal using said first signal. 

4. A method according to claims 1-2, characterised in that spectral Envelope adjustment is performed based on 
transmitted envelope information of said discarded frequency band(s) of said original signal. 

20 

5. A method according to claim 4, characterised in that said spectral envelope information is transmitted as 
subband samples in an arbitrary number of subband channels where tlie gains of said subband chaimels are set to a 
low level; whereby compatibility with standardised decoders is ensured. 

25 6. A method according to claim 4, characterised in that said envelope information is transmitted as scale factors 
without transmission of the corresponding subband samples. 

7. A method according to claim 4, characterised in that said envelope information is transmitted as scale factors 
and the corresponding subband samples are set to zero or a constant value; whereby the entropy of the subband 

30 samples is reduced. 

8. A method according to claims 1-7, characterised in that said output signal, when monophonic audio, is split 
into two signals each comprised of said output signal and delayed versions of the same to obtain a pseudo-stereo 
signal. 

35 

9. A method according to claims 1-7, where said transposition is characterised by; 

filtering a signal through a set of ^ 2 bandpass filters with passbands comprising tlie frequencies 
respectively, forming bandpass signals; 

shifting said bandpass signals in frequency to regions comprising the frequencies M[fu...,ff^] where A/ 1 is 
40 the transposition factor, and 

forming a transposed signal by combining said shifted bandpass signals. ' 
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10. A method according to claim 9, characterised in that said frequency shifting is obtained tlirough upper side 
band (USB) modulation. 

11. A method for transposition by a factor characterised by: 

bandpass fiUering a signal using an analysis filter bank or transfonn of such a nature that i^l- or complex- 
valued subband signals of lowpass type are generated; 

patching an arbitrary number of channels k of said analysis filter bank or transform to channels Mk,hd^\/m 
a synthesis filter bank or transform; and 

forming a transposed signal using said synthesis filter bank or transform. 

12. A method according to claim 11, characterised in that said filter bank is maximally decimated and said 
patching is performed according to tlie relation 

wher^ (-l)(^"i>^" is a correction factor, vi^n) tlie subband signal of channel K and v^,{n) the subband signal of 
channel A^; whereby compensation of spectral inverted subband signals is obtained. 

13. A method according to claims 11-12, characterised by: 

patching the phases of the subband signals from channels k of said analysis filter bank or transform as the 
phases of the subband signals associated with synthesis channels jW3t, A/^i 1; and 

patching the magnitudes of the subband signals from consecutive channels / of said analysis filter bank 
transform as the magnitudes of the subband signals associated with consecutive synthesis channels where 5 
an integer 1. 



or 
is 



14. A method according to claims 11-13, characterised in that the phases of said subband signals of said chamiels 
k are multiplied by said factor M before using said synthesis filter bank or transform. 

15. A method according to claims 11 - 14, characterised in that A/= a:*' , where AT is an integer > 1. 

16. A method according to claims 11 - 15, characterised in that said patching employs multiple values of said 

transposition factor M. 
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17. A method for transposition by a factor A/, characterised by: 

filtering a signal through a parallel bank ofL filters with impulse responses as 



where * - 0, 1 £-1. A: is a constant, and;>o('') is a lowpass prototype filter of length N, producing a set of Z, 

complex-valued signals; 

downsampling said L signals with a factor L/M, producing a set ofL complex-valued subband signals; 
multiplying the phase-angles of said complex-valued subband signals by M, giving a new set of subband 
signals; 

selecUng the real part of said new set of subband signals, resulting in a set of real-valued subband signals; 
upsampling a subset of said real-valued subband signals witli a factor L', producing a set of real-valued 
signals; 

filtering said real-valued signals through a parallel bank ofL' filters with impulse responses as 



fk(n) = K'p'o(n)cos 



where A: - 0, 1, . . Z'-l, ^ is a constant and p'o(n) is a lowpass prototype filter of length l\r, forming a set of I' 
filtered signals; and 

adding said U filtered signals to produce a transposed signal. 

18. A method according to claim 17, characterised in that said multiplication of said phase-angles and said 
selecting of the real part, is computed by: 

writing said complex-valued subband signals as 

where Ri^n) and liin) are the real and imaginary parts of Zkin) respectively; 
calculating said real-valued subband signals lVk{n) as 



W, in) = |Z, in)\ cos|m arctaij^^^ jj , 



where \Z^n)\ - sqrt{/?K/i)^+/^n)'} andMis a positive integer transposition factor, using the trigonometric identity 
cosiMa) = cos^ (a) - ) sin^ (a) cos^'^ ^ ^ ^.^ 4 (^) _ ^ , ^ 

where a = arctan{/jfc(w)//?jt(n)}, and the relations 

cos(«) = Ai^andsin(a)=T^; 

whereby reducing computational complexity by elimination of all trigonometric calculations. 
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19. A method according to claim 17, characterised by: 

on a block basis, extracting information conveyed by the phase-difference of an adjacent pair of said 
complex-valued subband signals; 

performing said multiplication of said phase-angles by M forming a pair of said new subband signals; and 
negating one of said new subband signals, on a condition provided by said information; whereby 180° phase- 
shifts of the subband signals are retained when employing an even integer-valued transposition factor M, 



20. A method according to claim 19, characterised in that said information is given by the dot-product of said 
complex-valued subband signals Ziln) and 2k+i{n) according to 

10 Z, {n) o Z,,i (n) = R, in)R,,, (n) + /, (n)!,^, (n) , 

where Ri(n) and /.(/i) are the real and imaginary parts of Zy(n) respectively, / = k+\, and one of said new subband 
signals is negated provided said dot-product is negative. 

21. A method for transposition, by expanding or compressing a first signal in time and duplicating or discarding 
1 5 arbitrary long segments of said first signal, subsequently down- or up-sample said first signal, characterised by: 

performing transient detection on said first signal; 

determining which segment of said fu^t signal to be used when duplicating or discarding parts of said first 
signal depending on the outcome of the transient detection; 

adjusting the length! of said signal segment depending on the outcome of the transient detection; 
20 adjusting the number of samples used for each statevector depending on the outcome of the transient 

detection; 

adjusting the delay D between samples in said statevector depending on the outcome of the transient 
detection; 

adjusting the number of samples K between each statevector depending on the outcome of the transient 
25 detection; and 

searching for synchronisation points in chosen segment of said first signal, based on die synchronisation 
point found in the previous synchronisation point search. 

22. A method according to claim 21, characterised in that several transposers are interconnected in order to share 
30 synchronisation point information; whereby reduced computational complexity is achieved, 

23. A method according to claims 21 - 22, characterised in that said transposers are comiected to a suitable filter 
bank, and the signals fed into each of said transposers are filtered as to obtain an arbitrary specUal envelope of the 
new signal being the sum of said signals being processed by said transposers. 
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24. A system for enhanced decoding of a source coded signal derived from an original signal, characterised by- 

transposition means for transposing frequency band(s) of said source coded signal, forming a first signal- 
estimation means operating on said source coded signal, for estimation of the spectral envelope of said 
original signal; 

adjusting means for adjusting the spectral envelope of said first signal, based on said estimation; 
combining means for combining said source coded signal and said adjusted fu-st signal; whereby reduced 
bitrate at a given perceptual quality or improved perceptual quality at a given bilrate is achieved- 

25. An apparatus according to claim 24, operating when said output signal is monophonic audio characterised by: 

delaying means for delaying and attenuating means for attenuating said output signal forming a first delayed 

signal; 

delaying means for delaying and attenuating means for attenuating said output signal, using different 

parameters, forming a second delayed signal; 

means for adding said output and said first delayed signal, forming a left-cliannel output signal; and 

means for adding said output and said second delayed signal, forming a riglit-channel output signal; whereby 

obtaining a pseudo stereophonic signal. 



26. A system for enhanced source coding where said system comprises an encoder representing all units preceding a 
storage media or transmission channel, and a decoder representing all units following said storage media or 
transmission channel, characterised by: 

discarding means at said encoder for discarding frequency band(s) of an original signal, forming a first 

signal; 

extracting means at said encoder for extracting spectral envelope information of said original signal, forming 
a second signal; 

means at said encoder for coding of said first and second signals; 

transposition means at said decoder for transposing ftequency tandCs) of said first signal, forming a third 

signal; 

adjusting means at said decoder for spectral envelope adjustment of said third signal, based on said second 
signal; and 

combining means at said decoder for combining said first and said adjusted third signal; whereby reduced 
bitrate at a given perceptual quaUty or improved perceptual quality at a given bitrate is achieved. 

27. An apparatus according to claim 26, operating when said output signal is monophonic audio characterised by: 

delaying means for delaying and attenuating means for attenuating said output signal forming a first delayed 

signal; 

delaying means for delaying and attenuating means for attenuating said output signal, using different 

parameters, forming a second delayed signal; 

means for adding said output and said first delayed signal, forming a left-channel output signal; and 

means for adding said output and said second delayed signal, forming a right-chamiel output signal; whereby 

obtaining a pseudo stereophonic signal. 
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28. An apparatus for transposition a factor A/, characterised by: 

filtering a signal by means of an analysis filter bank or transform of such a nature that real- or complex- 
valued subband signals of lowpass type are generated; 

means for patching an arbitrary number of channels k of said analysis filter bank or transform to channels 
Mk, M ^1,1X13. synthesis filter bank or transform; and 

forming a transposed signal by means of said synthesis filter bank or transform. 

29. An apparatus according to claim 28, characterised in that when A/= JKr^ , where A: is an integer > 1, the phases 
of the subband signals of said channels k are multiplied by M before using said synthesis filter bank or transform. 

30. An apparatus for transposition by a factor characterised by: 

fdtering means for filtering a signal through a parallel bank of L filters with impulse responses as 



hk in) = K Po(n)exp\j^(2k + l)(n - — ) + ; (-1)^ 



4M 

where k^O, 1,..., Z,-l, ATis a constant, is a lowpass prototype filter of length andMis the transposition 
factor, producing a set of L complex-valued signals; 

means for downsampling said L signals with a factor L/M, producing a set of I complex-valued subband 
signals; 

means for multiplying the phase-angles of said complex-valued subband signals by M, giving a new set of 
subband signals; 

means for selecting the real part of said new set of subband signals, resulting in a set of L real-valued 
subband signals; 

means for upsampling a subset of said real-valued subband signals with a factor L\ producing a set of real- 
valued signals; 

filtering means for fihering said real-valued signals through a paraUel bank of L' filters with impulse 
responses as 



l2L 2 4 



where /: - 0, 1, . . , , L'-l , is a constant and p'o(n) is a lowpass prototype filter of lengUi AT, forming a set of L' 
filtered signals; and 

means for adding said L' filtered signals to produce the transposed signal. 
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31. An apparatus for iransposition, by expanding or compressing a first signal in time by duplicating or discarding 
arbifiraiy long segments of said first signal, and subsequenUy down- or up-sample said first signal characterised 
by: 

detection means for performing transient detection on said first signal; 

means for using the position of a possible transient when determiiiing which segment of said first signal to be 
used when duplicating or discarding parts of said first signal, in order to obtain said transposition; 

adjusting means for adjusting the length (L) of said signal segment depending on the output from die 
transient detector; 

adjusting means for adjusting the number of samples (AO used for each statevector depending on die output 
from the transient detector; 

adjusting means for adjusting the delay (£)) between samples in said statevector depending on the output 
from the transient detector; 

adjusting means for adjusting die number of samples (K) between each statevector depending on die output 
from the tiansient detector; and 

searching means for searching for synchronisation points in chosen segment of said first signal, based on tiie 
synchronisation point found in the previous synchronisation point search. 

32. An apparatus according to claim 31, operating on subband signals characterised by: 

means for sharing synchronisation information between multiple instances of said tiansposer; 
means for forming subsets of said subband signals; 

means for amplitude adjustment of every channel within each of said subsets; 

synthesis filter bank means for forming, from each of said subsets, an input signal to each instance of said 
transposers; 

processing said input signals by said transposers; and 

summation means for acquiring a new signal by summation of said processed signals; whereby an arbitrary 
spectral envelope can be obtained. 



wo 98/57436 



PCT/IB98/00893 



1/13 




Fig, 2 



wo 98/57436 



PCT/IB98/00893 



2/13 



|X{OI 



|Xbp(OI 




1X^(01 



|Y{OI 



0 f 



305 



Of. 



307 



Fig, 3 



i 


i. 


\ 
\ 


\ 














H 




-—"SOS 


















c 


^ ^0 




k 




— 




Input ^ 
segment 



Extern ~ 
sync-info 



Transient 1 
Detector 1 




r r 


Win 
pos 


dow 1 
tion 1 



'401 
403 



409 



Sync-pos 
memory 



Codebook 
generator 



407 



Sync 
select 



1 



405 

i^odebook 



Sync-pos 



411 



Min-difference 
estimator 



415 



417 



Output 
memory 



413 




segment 



Fig. 4 



wo 98/57436 



PCT/IB98/00893 



3/13 



501 



Input J 
data m 



Transient 



detector 



505 




transient? >r-Yes-> 



Store point of 
transient, 
trans_pos 



507 



511 



Comparing the 
position of the 
^ previous output 
^ segment with the 
position of the 
transient. 



513 



517 





509 



YesH 



Adjust window (L) 
position, and size 
of L, K and D. 



Search for 
synchronisation 
point 



51! 



521- 



515 



Store 
synchronisation 
point 



i 



Read out new 
segment 



525 



527 



523 



^ Overlap add new 
\^ and previous 
segment using a 
window 



Down 
sample 




Output 
data 



Fig. 5 



wo 98/57436 



PCT/IB98/00893 



4/13 



Calculate new 
synchronisation 
point, based on the 
previous 



601 



val=sum(abs( 
input(sync_pos:sync_pos+N)- 
coclebook(sync_pos:syncjDos+N))) 



607 



^~~f return '^Yes 



611 




605 



615 



val=sum(abs(input(i:i+N)- ,. 
coclebook(i:i+N))) | 



sync_pos=i 
min_val=val 


pYes-< 

p 


^^^^^^;:■:■:v^^^^^: 




im 

617 












»^ 




No— ' 



return 



603 



Fig. 6 



wo 98/57436 



PCT/IB98/00893 



5/13 



input signal 



Synchronization , 

points Inputsegments 




Codebooks 1^ 1 2.3 ^[ 

Fig. 7a 

Synchronization 

pointe^ Comparation segments 

Output signal 

▲ 1 




Previous output r\..+r^..f« « . 

Output segments 
segment ^ 



(U 

c 
c 

(0 

JC 

o 
c 

■e 



3 

CO 



3 

I 



3 

CD 



Spectral 
envelope 
information 




815 

Wideband 
output 



Fig. 8 



wo 98/57436 



6/13 



PCT/1B98/00893 




919 

Fig. 9c 



wo 98/57436 



PCT/IB98/00893 



7/13 



hin)e 







n 


1 — ►( 








e 










n 


4— »/ 




1001 




1003 ^ 





N 

Fig. 10a 



x{n) 



—h(n)cosicoi,n) 



1009 




• () 



1005 



cos((M-l)tyjt/j)( E\ • ^^"^ 



— A(n)sin(<y^/j) 



1007 




1013 



sin((M - \)(Okn) 

Fig, 10b 



X,(rR) 



|Xk(rR)| 



^Si^iarg(X,(rR)) 



P-^R 



1101 



Fig. 11 



1103 1105 



0 7 1201 

PQQQQQOQ- — ^ 




1205 



000066660' 

0 



16 



Fig, 12 




wo 98/57436 



PCT/IB98/00893 



8/13 



QQOO 



OOOOOOOO 

0 




)0O000D0OODO0OO0t)0O0t)000O 

32 



4 




k. 


^ w 




II 




III 




1 — 


IV 


^ 



Fig, 14 




oooooooo< 

0 II — II— II— II 

HI III III III 

IV IV — 



Fig. IS 



IV IV 

V V 




32 



OOOOOOOO 



oooo 



OOOOgOQQQQQOOOOQOO 



0000000000066666666666666 

32 




Fig. 17 



wo 98/57436 



PCT/IB98/00893 



9/13 



1801 

PCM audio , subband 

■ " ^ ^ "^sample: 



1803 



bitstream 



analysis filter 
bank 




quantization 
and coding 



_j perceptual 
nnodel 




1807 

Fig. 18a 



1805 




bitstream 
multiplex 



encoded 
bitstream 



1809 

encoded 
bitstream / 




1811 

r 



dequantization 
and decoding 



1813 

subband ^ PCM audio 

sampleS|^ ^ ^ X bitstream 
synthesis filter \ 
bank 




Fig. 18b 



X(z) 



r 


1901 




1903 1905 

— ^ ' \. 


H„(2) 




\ L 


i v,(z) 


i 


i 










Hi(z) 




i. L 


1 V,(z) 






• 
1 

I 




1 
1 
1 






m 


IL 


i V,.,(z) 


% — ^ 


1 ► 

i 


mmmmm 









1907 



t L 



Fo(z) 



ill 



t L 



Fi(z) 



t L 



X(Z) 



Analysis bank 



Synthesis bank 



Fig. 19 



wo 98/57436 



PCT/IB98/00893 



10/13 



2001 2003 2005 2007 




Fig. 20 



2101 



2103 



2105 



L Received 
subband 
samples 



QL-channel 
filterbank/ 
transform 



r 



Dovvn- 
sampiing 



2107 



-channel 
modified 
filterbank/ 
transform 



2109 

Tj^channel' 
modified 
filterbank/ 




I transfonn ^ 



•1 
■■>i 



phase- 
multiplier 

2111 



phase- I 
multiplier 




i 



gain 
adjustment 



K-channel 
filterbank/ 
transform 



Fig. 21 



1 

-\j gam 
■-/) adjustment 

L I 



— > 



LJ 



wo 98/57436 



PCT/IB98/00893 



11/13 



2201 



2203 




2205 



2207 



2209 



2221 



2223 




Begin 




:;::i?..^::-C^::v:3:>--^ 



Choose expansion 
factor Q 



v.w.■.^^vx•:■^^ji^v:^■:v:->^^^>:❖:•^KW•:•^^^ 



Synthesize the received L 
subband signals, forming 
signal x, ' 

❖>»;w;^w>^>:.>x.>K^::S^.:.:.>>;«<.*.:^ 



Downsample a copy of x^ 
by the factor Q, forming 
signal x^ 





Choose 
transposition 
factor(s) M. > 



Choose a filter 
bank size K 



Produce the subband 
signals s^^ in a 
transposition-loop 



Synthesize the subband 
signals s^, forming signal Xj § 



Add signal x^ to signal Xj, 
forming the signal y 




2211 



For every i, compute \(\ 



Ti = (KMi)/Q 




1F#'2|213 



Analyze Xj with an Tj- 
channel modified filterbanl< f ' 



1 



^{215 



Multiply the phase-angles 
by Mj, and take the real 
value 



I 

: I 



Adjust the amplitude of the 
subband samples 



ll 



Add the subband samples 
to the samples s^ 



i 



■No- 



Every i processed? 



ra:->»:-:V;::S::>^:;ffK:v:: 



Yes 



Fig. 22 



wo 98/57436 



PCT/IB98/00893 



12/13 



15 



1 i 1 / 



I I I I i I I i I 



! I 




2305 



2301 



0000oooo6o66666cK — 
Fig, 23 



2303 




15 

□□□□□□□gDDPDDaap 




2601 



2603 



6666666606 
Fig. 26 



wo 98/57436 



PCT/IB98/00893 





2901 2903 2905 



In (M) 




Fig. 29 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
Incexnational Bureau 




PCX 

INTERNATIONAL APPLICATION PUBUSHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification ^ : 
H04B y66 



A3 



(11) International PubUcation Number: WO 98/57436 

(43) International Publication Date; 17 December 1998 (17.12.98) 



(21) International Application Number: PCT/IB98/00893 

(22) International Filing Date: 9 June 1998 (09.06.98) 



(30) Priority Data: 
9702213-1 
9704634-6 
9800268-6 



10 June 1997(10.06.97) SE 
12 December 1997 (12.12.97) SE 
30 January 1998 (30.01.98) SE 



(71)(72) Applicant and Inventor: LEJERYD, Lars, 
(SE/SE]; Vintervagen 19, S-171 34 Solna (SE). 



Gustaf 



(72) Inventors; and 

(75) Inventors/Applicants (for US only): EKSTRAND. Per, Rune, 
Albin [SE/SE]; Rensticmas Gata 29, S-116 31 Stockholm 
(SE). HENN. Lais, Frcdrik [CH/SE]; Ritarvagen 14, S-168 
31 Bromma (SE). KJORUNG. Hans, Magnus, Kristofer 
[SE/SE]; Vindhemsgatan 190, S-752 27 Uppsala (SE). 

(74) Common Representative: LHJERYD, Lars. Gustaf; Vinter- 
vagen 19. S-171 34 Solna (SE). 



(81) Designated States: AL, AM, AT. AU. AZ, BA, BB, EG BR 
BY. CA, CH, CN. CU, CZ. DE. DK, EE, ES. FI, Gb! Ge' 
GH, GM, GW. HU. ID, IL. IS. JP. KE, KG, KP. KR, KZ 
LC, LK, LR. LS, LT, LU, LV, MD, MG. MK, MN, MW 
MX, NO, NZ. PL, PT. RO. RU. SD, SE, SG, SI. SK, SL 
TJ, TM, TR, TT, UA, UG. US, UZ. VN. YU. ZW, ARIPO 
patent (GH, GM. KE, LS. MW. SD, SZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG. KZ, MD, RU, TJ. TM). European 
patent (AT, BE. CH. CY, DE, DK, ES, H, FR, GB, GR. 
IE, IT. LU, MC. NL. PT. SE), OAPI patent (BF. BJ, CF. 
CG, CI. CM, GA. GN. ML. MR, NE, SN. TD, TG). 



Published 

H^ith international search report. 

Before the expiration of the time limit for amending the claims 
and to be republished in the event of the receipt of amendments. 

(88) Date of publication of the international search report: 

10 February 2000 (10.02.00) 



(54) Title: SOURCE CODING ENHANCEMENT USING SPECIRAL-BAND REPLICATION 



103 107 




(57) Abstract 

The present invention proposes a new method and apparatus for the enhancement of source coding systems. The invention employs 
bandwidth reduction (101) prior to or in the encoder (103). followed by spectral-band replication (105) at the decoder (107). This is 
accomplished by the use of new transposition methods, in combination wifli spectral envelope adjustments. Reduced bitrate at a given 
perceptual quality or an improved perceptual quality at a given bitrate is offeied. The invention is preferably integrated in a hardware 
or software codec, but can also be implemented as a separate processor in combination with a codec. The invention offers substantial 
improvements practically independent of codec type and technological progress. 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovalda 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The fionner Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


krael 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Vict Nam 


CG 


Congo 


K£ 


Kenya 


NL 


Nethedands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


C6te d'lvoirc 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Rqniblic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazalutao 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Gennany 


U 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


BE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







INTERNATIONAL SEARCH REPORT 



International Application No 

r /IB 98/QG893 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 6 H04B1/66 



According to Intemationai Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation soarched (classification system followed by classification symbols) 

IPC 6 H04B 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category " Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



US 5 127 Q54 A (HONG DAEHYOUNG ET AL) 30 
June 1992 

cited in the application 
see figure FIGURE 
see column 1, line 41 - line 50 



1-4.9. 
11.24. 
26.28 



see column 1, 
see column 2, 

US 4 667 340 A 
May 1987 



line 65 
line 11 



column 
column 



2, 
3, 



1 ine 
1 ine 



2 

33 



(ARJMAND MASUD ET AL) 19 



see abstract; figures 4,5 

see column 3. line 8 - line 43 

see column 7, line 48 - column 8, line 22 



-/-• 



I- 4.24, 
26 

II- 14 



Further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



* Special categories of cited documents : 

*A* document defining the general state of the art which is not 
considered to be of particular relevance 

*E* earlier document but published on or after the intemationai 
filing date 

'L' document which may throw doubts on priority claim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

*0' document referring to an oral disclosure, use, exhibition or 
other means 

*P* document published prior to the international filing date but 
later than the priority date claimed 



T" later document published after the intemationai filing date 
or priority date and not in conflict w(tti the application but 
cited to understand the principle or theory underiying the 
invention 

'X* document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
invoh/e an inventive step when the document is taken alone 

"Y" document of particular relevance; the claimed Invention 

cannot be considered to invo*ve an inventive atep when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skitled 
in the art. 

*&' document member of the same patent family 



Date of the actual completion of the International search 

26 April 1999 



Date of mailing of the intemationai search report 



2! 12. 1999 



Name and mailing sdc^ss of the ISA 

European Patent Office, P. B. 5818 Paten tlaan 2 
NL * 2280 HV Rijswi^ 
Tel. (+31-70) 340-2040. Tx. 31 651 epo nl. 
Fax: (+31-70)340-3016 



Authorized officer 



Sieben. S 



Form PCT/ISA/210 (tacond aneet) (Juty 1992) 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 


International Application No 

F /IB 98/00893 


C.(Continualion) DOCUMENTS CONSIDERED TO BE RELEVANT 


Category " 


Citation of docunwnt, with indication, where appropriate, of the relevant paaiages 


Relevant to claim No. 


A 


US 5 068 899 A (ELLIS JOHN G ET AL) 26 

November 1991 

see abstract; figures 1-7 

see column 1, line 39 - line 52 

see column 2, line 1 - line 36 

see column 2, line 64 - column 3, line 5 

see column 5, line 62 - column 6, line 14 




1-4,6, 
24,25 


A 


A J S FERREIRA: "Audio Spectral Coder" 
AES REPRINT 4201, 10OTH CONVENTION, 
COPENHAGEN, 

11 - 14 May 1995, XPO021OO534 
cited in the application 
see figure 3 

see page 5, paragraph 4.2; figures 2,4,5 
see page 7, paragraph 5; figure 4 




1-4.7, 
17,24, 
26,30 



Form PCT/ISA/ZIO (contmuaiion o* looond »h«rt} (July 1992) 



page 2 of 2 



INTERNATIONAL SEARCH REPORT 



.rnational application No. 

PCT/IB 98/00893 



Box t Observations where certain claims were found unsearchable (Continuation of Item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 1 7(2)(a) for the following reasons: 
1. I I Claims Noa,: 

because they relate to subject matter not required to be searched by this Authority, namely: 



□ 



Claims Nos.: ^ 

because they relate to parts of the international Applicatbn that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



3. aaimsNos.: 

because they are dependent claims and are not drafted In accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This Intemational Searching Authority found multiple inventions in this international application, as follows: 

see additional sheet 



1 . I I As aU required additional search fees were timely paid by the applicant, this International Search Report covers all 
' ' searchable claims. 

2. I I As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
' ' covers only those claims for which fees were paid, specifically claims Nos.; 



4. I Y I No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

1-20, 24-3G 



Remark on Protest | j The additional search fees were accompanied by the applicant's protest. 

I I No protest accompanied the payment of additional search fees. 



Form PCT/ISA/210 (continuation of first sheet (1)) (Jufy 1998) 



International ApplicaUon Na PCT/IB 98/00893 



FURTHER INFORMATION CONTINUED FROM PCT/iSA/ £10 



1. Claims: 1-20.24-30 

Enhancement of a source coding system comprising an encoder 
discarding frequency bands, a decoder performing a 
transposition of frequency bands from a 1st signal to a 2nd 
signal and a combiner to generate an output signal from 1st 
and 2nd signal 



2. Claims: 21-23,31,32 

Method and apparatus for computationally efficient 
transposition by expanding or compressing a time signal 



INTERNATIONAL SEARCH REPORT 

Information on patent iamily members 



International Application No 

' 7IB 98/0G893 



Patent document 
cited in search report 


Publication 
date 


Patent family 
member(s) 


Publication 
date 


tic CI o*7nc A 


A 


30-05-1992 


NONE 






US 466734Q 


A 


19-05-1987 


EP 


0124728 


A 


14-11-1984 








JP 


1807228 


C 


10-12-1993 








JP 


5016599 


B 


04-03-1993 








JP 


60035799 


A 


23-02-1985 


US 5068899 


A 


26-11-1991 


CA 


■ 1220282 


A 


07-04-1987 



Form PCT/ISA/210 {patent tjniN annex} (July 1992) 



